# Real-time Voice Conversation
Minicpm O 2 6
MiniCPM-o 2.6 is a GPT-4o-level multimodal large model that runs on mobile devices, supporting vision, voice, and live stream processing
Multimodal Fusion
Transformers Other

M
openbmb
178.38k
1,117
Minicpm V 2 6
MiniCPM-V is a mobile GPT-4V-level multimodal large language model that supports single-image, multi-image, and video understanding, equipped with visual and optical character recognition capabilities.
Image-to-Text
Transformers Other

M
openbmb
91.52k
969
Featured Recommended AI Models